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SEQUENCING OP SURFACE IMMOBILIZED POLYMERS 
UTILIZING MICROFLDORESCENCE DETECTION 

5 . . " 

BACKGROUND OF THE INVENTION 

The present invention relates to the determination of 
the sequences of polymers immobilized to a substrate, in 
particular, one einbodlment of the invention provides a method 

10 and apparatus for sequencing many nucleic acid sequences 
immobilized at distinct locations oh a matrix surface. The 
principles and apparatus of the present invention may be used, 
for example, also in the determination of sequences of 
peptides, polypeptides, oligonucleotides, nucleic acids, 

15 oligosaccharides, phospholipids and other biological polymers. 
It is especially useful for determining the sequences of 
nucleic acids and protieins. 

The structure and function of biological molecules are 
closely interrelated. The structure of a biological polymer, 

20 typically a macromolecule, is generally determined by its 
monomer sequence. For this reason, biochemists historically 
have been interested in the sequence characterization of 
biological macromolecule polymers. With the advent of 
molecular biology, the relationship between a protein sequence 

25 emd its corresponding encoding gene sequence is well 

understood. Thus, characterization of the sequence of a 
nucleic acid encoding a protein has become very important. 

Partly for this reason, the development of 
technologies providing the capability for sequencing enormous 

30 amounts of DNA has received great interest. Technologies for 
this capability are necessary for, for example^ the successful 
completion of the human genome sequencing project. Structural 
characterization of biopolymers is very important for further 
progress in many areas of molecular and cell biology. 

35 While sequencing of macromolecules has become 

extremely is^ortant, many aspects of thes technologies have 
not advanced significantly over the past decade. For example, 
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in the protein sequencing technologies being applied today the 
Edman degradation methods . are still being used. See, e.g, , 
Knight (1989) "Microsequencers for Proteins and 
Oligosaccharides," R^o/Teehnol, 7:1075-1076. Although advanced 
5 instrumentation for protein sequencing has, been developed, S£e, 
e.g., Frank et al . (1989) "Automation of DNA Sequencing 
Reactions and Related Techniques: A Work Station for 
Micromanipulation of Liquids," Bio/Technol. 6:1211-1213, this 
technology utilizes a homogeneous and isolated protein sample 
10 for determination of removed residues from that homogeneous 
sample^ 

Likewise, in nucleic acid sequencing technology, three 
major methods for sequencing have been developed, of which two 
are commonly used today, see , e.g., Sambrook et al « (1989) 

15 Molecular cloning; A Labor al^orv Manual (2d Ed,) Vols. 1-3, 
Cold Spring Harbor Press, New York, which is hereby 
incorporated herein by reference. The first method was 
developed by Maxam and Gilbert- See, e.g., Maxam and Gilbert 
(1980) "Sequencing End-Labeled DNA with Base-Specific Chemical 

20 Cleavages," MeiAods ±n Enzvmol, 65:499-560, which is hereby 

incorporated herein by reference. The polymer is chemically 
cleaved with a series of base-specific cleavage reagents 
thereby generating a series of fragments of various lengths. 
The various fragments, each resulting from a cleavage at a 
25 specific base, are run in parallel on a slab gel which resolves 
nucleic acids wbich differ in- length by single nucleotides. A 
protein specific label allows detection of cleavages at all 
nucleotides relative to the position of the label. 

This separation requires high resolution 
30 electrophoresis or some" other system for separating nubleic 
acids of very similar size- Thus, the target nucleic acid to 
be sequenced must usually be initially purified to near 
homogeneity. 

Sanger and Coulson devised tw alternative methods for 
35 nucleic acid sequencing- The first method, known as the plus 
and minus method, is described in Sanger and Coulson (1975) 
Hol. Biol. 94:441-448, and has been replaced by the second 
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method. Subsequently, Sanger and Coulson developed another 
improved sequencing method known as the dideoxy chain 
termination method. See . e,g., Sanger et al , (1977) "dna 
Sequencing with Chain-Termination Inhibitors," Proc. Natl. 
5 Acad. Sci. PSA 74:5463-5467, which is hereby incorporated 

herein by reference. This method is based on the inability of 
2', 3' dideoxy nucleotides to be elongated by a polymerase 
because of the absence of a 3 * hydroxyl group on the sugar 
ring, thus resulting in chain termination. Each of the 
10 separate chain terminating nucleotides are incorporated by a 
DNA polymerase, and the resulting terminated fragment is known 
to end with the corresponding dideoxy nucleotide. However, both 
of the Sanger and Coulson sequencing techniques usually require 
isolation and purification of the nucleic acid to be sequenced 
15 and separation of nucleic acid molecules differing in length by 
single nucleotides. 

Both the polypeptide sequencing technology and the 
oligonucleotide sequencing technologies described Eibove suffer 
from the requirement to isolate and work with distinct 
20 homogeneous molecules in each determination. 

In the polypeptide technology, the terminal amino acid 
is sequentially removed and analyzed. However, the analysis is 
dependent upon only one single amino acid being removed, thus 
requiring the polypeptide to be homogeneous. 
25 In the case of nucleic acid sequencing, the present 

techniques typically utilize very high resolution 
polyacrylamide gel electrophoresis. This high resolution 
separation uses both highly toxic acrylamide for the separation 
of the resulting molecules and usually very high voltages in 
30 running the electrophoresis. Both the pxirification and 

isolation techniques are highly tedious, time consuming and 
expensive processes. 

Thus, a need exists for the capability to 
simultane usly sequence many biological polymers without 
35 individual isolation and purification. Moreover, dispensing 
with the need to individually perform the high resolution 
separation of related molecules leads to greater safety, speed, 
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and reliability. The present invention solves these and many 
other problems. 

StFMHARY OP THE INVENTION 

5 The present invention provides the meEais to sequence 

hundreds/ thousmds or even millions of biological 
macromolecules . simtiXtaneously and without indivixiually 
isolating each macromolecule to be sequenced. It also 
dispenses with the requirement, in the case of nucleic acids, 
10 to separate the products of the sequencing reactions on 

(iangerous polyacrylamlde gels. Adaptable to automation, the 
cost and effort required in sequence analysis will be 

dramatically reduced. 

This invention is most applicable, but not limited, to 
15 linear macromolecules. It also provides specific reagents for 
sequencing botli oligonucleotides and polypeptides- It provides 
an apparatus for automating the processes described herein. 

The present invention provides methods for determining 
the positions of polymers i^ich terminate with a given monomer, 
20 where said polymers are attached to a stirface having a 

plurality of positionally distinct polymers attached thereto, 

said method comprising the steps of: 

labeling a terminal monomer in a monomer type 

specific manner; and 

25 scanning said surface, thereby determining the 

positions of said label. In one embodiment, the polymers are 
polynucleotides, and usually the labeling of the terminal 
marker con^rises incorporation of a labeled terminal monomer 
selected from the group of nucleotides consisting of adenine, 

30 cytidine, guanidine and thymidine. 

An alternative embodiment provides methods for 
concurrently determining which subset of a plurality of 
positionally distinct polymers attached to a solid substrate at 
separable locations terminates with a given terminal subunit, 

35 said method comprising the steps of: 

mixing said solid substrate with a solution 
comprising a reagent, yhich selectively marlcs positionally 
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distinct polymers which terminate with said given terminal 
suhunit; and 

determining with a detector which separable 
locations are marked, thereby determining which subset of said 
5 positionally distinct polymers terminated with said given 
terminal subunit. In one version, the solution comprises a 
reagent which marks the positionally distinct polymer with a 
fluorescent label moiety. In another version the terminal 
subunit is selected from the group consisting of adenosine, 
10 cytosine, guanosine, and thymine. 

Methods are also provided for determining which subset 
of a plurality of primer polynucleotides have a predetermined 
oligonucleotide, wherein the polynucleotides are complementary 
to distinctly positioned template strands which are attached to 
15 a solid substrate, said method comprising the steps of: 

selectively marking said subset of primer 

polynucleotides having the predetermined oligonucleotide; and 

detecting which polynucleotides are marked. In 

one embodiment, the oligonucleotide subunit is a single 
20 nucleotide; in another the marking comprises elongating said 

primer with a labeled nucleotide which is complementary to a 

template; and in a further embodiment the marking step uses a 

polymerase and a blocked and labeled adenine. 

The invention embraces methods for concurrently 
25 obtaining sequence information on a plurality of 

polynucleotides by use of a single label detector, said method 

comprising the steps of: 

attaching a plurality of positionally distinct 

polynucleotides to a solid substrate at separable locations; 
30 labeling said plurality of polynucleotides with a 

terminal nucleotide specific reagent, said label being 

detectable using said label detector; 

determining whether said specific labeling 

reagent has labeled each separable location. Often, the 
35 labeling is performed with reagents which can distinguishably 

label alternative possible nucleotide monomers. One embodiment 

uses four replica substrates each of which is labeled with a 
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specific labeling reagent for adenine, cytosine, guanine,, or 
thymine. Usually, the labeling and determining steps are 
performed in succession using reagents specific for each of 
adenine, cytosine, guanine, and thymine monomers. 

An alternative embodiment provides methods for 
concurrently obtaining sequence information on a plurality of 
polynucleotides, said method comprising the steps of: 

attaching distinct polynucleotides to a plurality 

of distinct solid substrates? 

labeling said plurality of solid substrates with 
a terminal nucleotide specific labeling reagent; and 

determining vrtiether said specific labeling 
reagent has labeled each distinct substrate. The method can be 
performed using in a continuous flow of distinct solid 
15 substrates through a reaction solution. 

A method is provided for simultaneously sequencing a 
plurality of polymers made up of monomer units, said plurality 
of polymers attached to a substrate at definable positions, 
said method comprising the steps of: 
20 mixing said substrate with a reagent which 

specifically recognizes a terminal monomer, thereby providing 
identification among various, terminal monomer units; and 

scanning said substrate to distinguish signals at 
definable positions on said substrate; and 
25 correlating said signals, at defined positions on 

said substrate to provide sequential series of sequence 
determinations. Often, the plurality of polymers are 
synthesized by a plurality of separate cell colonies, and the 
polymers may be attached to said substrate is by a carbonyl 
lihJcage. In one embodiment, the polymers are polynucleotides, 
and often the substrate coi[5>rises silicon. The scanning will 
often identify a fluorescent label. In one embodiment, the 
reagent eschibits specificity of removal of terminal monomers, 
in another, the reagent exhibits specificity of labeling of 
35 terminal monomers. 
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The invention also embraces me'thods for sequencing a 
plurality of distinctly positioned polynucleotides attached to 
a solid siibstrate comprising the steps of: 

hybridizing complexaentary primers to said 
5 plurality of polynucleotides; 

elongating a coaplenentary primer hybridized to a 
polynucleotide by adding a single nucleotide; and 

identifying which of said complementary primers 
have incorporated said nucleotide. In some versions, the 

10 elongating step is performed simultaneously on said plurality 
of polynucleotides linked- to said sijbstrate. Typically, the 
substrate is a two dimensional surface Eurid the identifying 
results from a positional determination of the complementary 
primers incorporating the single defined nucleotide. A silicon 

15 substrate is useful in this method. 

Methods, are provided where the linking is by 
photocross linking polynucleotide to said complementary primer, 
where said primer is attached to said substrate. The 
elongating will be often catalyzed by a DNA dependent 
20 polymerase. In various embodiments, a nucleotide will have a 
removable blocking moiety to prevent further elongation, e.g., 
NVOC. 

A nucleotide with both a blocking moiety and labeling 
moiety will be often tised. 
25 A further understanding of the nature and advantages 

of the invention herein may be realized by reference to the 
remaining portions of the specification and the attached 
drawings* 

30 BRIEF DESCRIPTION OF THE FIGURES 

Fig. 1 illustrates a simplified and schematized 
embodiment of a degradative scheme for polymer sequencing. 

Fig. 2 illustrates a simplified and schematized 
embodiment of a synthetic scheme for polymer sequencing. 
35 Fig. 3 illustrates a coordinate mapping system of a 

petri plate containing c lonies. Bach position of a col ny can 
be assigned a distinct coordinate position. 
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Fig. 4 illustrates various modified embodiments of the 
substrates . 

Fig. 5 illustrates an idealized scanning result 
corresponding to a particular colony position. 
5 Fig. 6 illustrates particular linkers useful for . _ 

attaching a nucleic acid to a sUicon substrate. Note that 
thymine may be substituted by adenine, cytidine, guanine, or 
uracil • 

Fig. 7 iUustrates an embodinent of the scanning 

lb system and reaction chamber. 

Pig. 8 illustrates the application of the synthetic 
scheme for sequencing as applied to a nucleic acid cluster 
localized to a discrete identified position. Fig. 8i 
illustrates schematically, at a molecular level, the sequence 
of events which occur during a particular sequencing cycle. 
Fig. 8B illustrates, in a logic flow chart, how the scheme is 

performed. — . . . 

Fig. 9 illustrates the synthesis of a representatxve 

nucleotide analog useful in the synthetic scheme. Note that 
20 the FMOC may be attached to adenine, cystosine, or guanine. 

Fig. 10 illustrates the application of the degradative 
scheme for sequencing as applied to a nucleic acid cluster 
localized to a discrete identified position. Pig. lOA 
illustrates schematically, at a molecular level, the sequence 
25 of events which occur during a particular sequencing cycle. 
Fig. lOB illustrates in a logic flow chart how the scheme iS 
performed. 

Fig. 11 illustrates a functionalized apparatus for 
performing the scanning steps and sequencing reaction steps. 



15 
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DETAILED DESCRIPTION OF THE FRETERRED EMBODIMENTS 

I. Sequencing Procedure for a Generic Polymer 

A. Overview 

1. Substrate and matrix 

2. Scanning system 

3. Synthetic/ degradative cycles 

4. Label 

5. Utility 

B. Substrate/Matrix 

1 , Non-distortable 

2. Attachment of polymer 

C. Scanning system 

1. Mapping to distinct position 

2. Detection system 

3. Digital or analog signal 

D. Synthetic or degradative cycle 
1* Synthetic cycles 

a, synthetic scheme 

b. blocking groups 

2. Degradative cycles 

3, Conceptual principles 

E. Label 

1. Attachment 

2. Mode of detection 

F. Utility 



II* Specific Embodiments 
A, Synthetic method 
30 B. Chain degradation method 



III. Apparatus 



* * * It * 



I. Semiencina Procedure for a Generic Polvmer 

The present invention provides methods and apparatus 

40 for the preparation, and use of a substrate having a plurality 
of polymers with various sequences where each small defined 
contiguous area defines a small cluster of homogeneous polymer 
sequences. The invention is described herein primarily with 
regard to the sequencing of nucleic acids but may- be readily 

45 adapted to the sequencing of other polymers, typically linear 
biological macromolecules. Such polymers include, for example, 
both linear cyclical polymers or nucleic acids, 
polysaccharides, phospholipids,, and peptides having various 
different amino acids, heteropolymers in which the polymers are 



10 



15 



20 



25 
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mixed, poXyurethanes, polyesters, polycarbonates, polyureas, 
polyamides, polyethyleneimines, polyarylene sulfides, 
polysiloxanes, polyimides, • polyacetates or mixed polymers of 
various sorts. In a preferred embodiment, the present 
invention is described in the use of sequencing nucleic acids. 

various aspects of PCT publication no, W090/15070, 
published December 13, 1990; PCT publication no. WO91/07087, 
published Kay 30, 1991; O.S.S.N. 07/624,120, filed December 6, 
1990; and U.S.S.N. 07/624,114, filed December 6, 1990; each of 
^rtiich is hereby incorporated herein by reference, are 
applicable to the substrates and matrix materials described 
Herein, to the apparatus used for scanning the matrix arrays, 
to means for automating the scanning process, and to the 
linkage of polymers to a substrate. 

A. overview 

— The present invention is based, in part,, on the 

ability to perform a step wise series of reactions which either 
extend or de^de a polymer by defined units. 

Figure 1 schematizes a simplified linear two monomer 
polymer made up of A type and B type subunits. A degradative 
scheme is illustrated. Panel A depicts a matrix with two 
different polymers located at positions lo and 14, but with no 
polymer linked at position 12. A reaction is employed to label 
all of these polymers at the terminus opposite the attachment 
of the monomer. Panel B illustrates a label (designated by an 
asterisk) incorporated at position 16 on the terminal monomers. 
A scan step is performed to locate positions 10 and 14 where 
polymers have been linked, but no polymer is located at 

, position 12. The entire matrix is exposed to a regent which is 
specific for removing single terminal A monomers, which are 
also labeled. The regent is selected to remove only a single 
monomer; it will not remove further A monomers. Removal Of the 
labeled A monomer leaves a substrate as illustrated in panel C. 

i A scan step is performed, and compared with the previous scan, 
indicates that the polymer located at position 12 has lost its 
label, i.e, that polymer at 12 terminated with an A mononer. 
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The entire matrix is then exposed to a second reagent which is 
specific for removing terminal B monomers which are also 
labeled. Note that only a single B on each monomer is removed 
and that successive B monomers are not affected. Removal of 
5 the labeled B monomer leaves a substrate as illustrated in 
panel D. Another scan step is performed, indicating that the 
polymer located at position 14 has lost its label, i.e., it 
terminated with a B monomer • The sequence of treatments and 
scans is repeated to determine the successive monomers. It 

10 will be recognized that if the labeled A and B are 

distinguishable, i.e., the label on polymers at sites 10 and 14 
may be distinguished, a single removal step can be performed to 
convert the substrate as illustrated in panel B directly to 
that illustrated in panel D. 

15 An alternative embodiment employs synthetic reactions 

where a synthetic product is made at the direction of the 
attached polymer. The method is useful in the synthesis of a 
complementary nucleic acid stremd by elongation of a primer as 
directed by the attached polymer. 

20 Pigure 2 illustrates a similar simplified polymer 

scheme, where the A and B monomer provide a complementary, 
correspondence to A' and B* respectively. Thus, an A monomer 
directs synthetic addition of an A* monomer and a B monomer 
directs synthetic addition of a B' monomer. Panel A depicts 

25 monomers attached at locations 18 and 22, but not at location 
20. Each polymer already has one corresponding complementary 
monomer A' . The matrix, with polymers, is subjected to sm 
elongation reaction which incorporates, e.g. , single labeled A' 
monomers 24 but not B* monomers, as depicted in panel B. The 

30 label is indicated by the asterisk. Note that only one A 
monomer is added. A scan step is performed to determine 
Whether polymers located at positions 18 or 22 have 
incorporated the labeled A' monomers. The polymer at position 
18 has, while the polymer at position 22 has not. Another 

35 elongation reaction which incorporates liabeled B* monomers 26 
is performed resulting in a matrix as depicted in panel C. 
Again note that only one, and not successive B» monomers, is 
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added, tootber scan is performed to determine whether a 
polymer located at sites 18 or 22 has incorporated a ^^^eled B 
loZer, and the result indicates that the PO^V^er^^^^^^^^ 
aite 22 has incorporated the labeled B' monomer. A f^^. 
5 removes all of the labels to provide a substrate as depicted. in 
panel D. AS before, if the polymer which incorporated a 
labeled A' monomer is distinguishable from a polymer wh^ch 
incorporated a labeled B- monomer, tbe separate elongatxon 
reactions may be combined producing a panel C type matrix 

XO directly from a panel A type matri. and the scan procedure can 
distinguish which terminal monomer was incorporated. 

It will be appreciated that the process may be applied 
more complicated polymers having more different types^of 
monomers. Also, the number of scan steps can ^^J^^^Jf 

15 the various possible labeled monomers can be differentiated by 

the detector system. 

Typically, the units will be single monomers, though 
under certain circumstances the units may comprise dlmers, 
ZZrs. or longer segments of defined length. Xn fact, under 
.0 S^aL Circumstances, tbe method may be operable^ removxn. 
or adding different sized units so long as the units are • 
distinguishable. However, it is very 

reaqents used do not remove or add successive monomers. This 
ins ved in tbe dc^adative method by use of highly spec^.c 
35 reagents. In the synthetic mode, this is often achieved with 
removable blocKing groups which prevent further elongation. 

one important aspect of the invention is the concept 
o. using a substrate bavin, bomo^eneous^us^ers poXy«srs 

— Which - 

single sequence. For eacarapJ-B, « * j 4. 

the reproducible identification and correlation of data 
^rlsUi^. - -stinct homogeneous --^ers^f^ y»« 
« locally attached to a matrix surface. Fig. 3 illustrates a 

" ^^iig - ^ -"^"""r: ^s^" 

of polymers from a producing colony of organises to a Batrxx 
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preserves spatial information thereby allowing positional 
identification. The positional identification allows 
correlation of data from successive scan steps. 

In one embodiment, bacterial colonies producing 
5 polymers are spatially separated on the media surface of a 
petri plate as depicted in panel A. Alternatively, phage 
plagues on a bacterial lawn can exhibit a similar distribution. 
A portion of panel A is enlarged and shown in panel B. 
Individual colonies are labeled C1-C7 . The position of each 
10 colony can be mapped to positions on a coordinate system, as 
depicted in panel C. The positions of each colony can then be 
defined, as in a table shown in panel D, which allows 
reproducible correlation of scan cycle results. 

Although the preferred embodiments are described with 
15 respect to a flat matrix, the invention may also be applied 

using the means for correlating xietection results from multiple 
samples after passage through batch or continuous flow 
reactions. For example, spatially separated polymers may be 
held in separate wells on a microtiter plate. The polymers 
20 will be attached to a substrate to retain the polymers as the 
sequencing reagents are applied and removed. 

The entire substrate surface, with homogeneous 
clusters of polymer attached at defined positions, may be 
subjected to batch reactions so the entire surface is exposed 
25 to a uniform and defined sequence of reactions. As a result, 
each cluster of target polymers for sequencing will be 
subjected to similar reactive chemistry. By monitoring the 
results of these reactions on each cluster localized to a 
defined coordinate position, the sequence of the polymer which 
30 is attached at that site will be determined. 

Fig. 4. panel A illustrates solid phase attached 
polymers linked to particles 32 ^ich are individually 
sequestered in separate wells 34 on a microtiter plate. The 
scanning system will separately scan each well. Pig. 4 panel B 
35 illustrates marbles 36 to which polymers are attached. The 
marbles are automatically fed in a continuous stream through 
the reaction reagents 38 and past a detector 40. The marbles 
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nay be carefiilly held in tubes or troughs which prevent the 
order of the beads from beuig disturbed. In a combination of 
the two eabodiaents, each polymer is attached to a pliirality of 
small marbles, and marbles having each polymer are separated, 
but retained in a knovn order. Each marble is, in batch with a 
number of analogous marbles having other polymers linked 
individually to them, passed through a series of reagents in 
the sequencing system. For example, &2, B2, and C2 are 
subjected to sequencing reactions in batch, with label 
incorporated only for the second monomer A3, B3, and C3, are 
likewise treated to determine the third nonomer. Likewise for 
A , B , and C . However, within each batch, the detection will 
usually occu^ in the order A, B, and C, thereby providing for 
correlation of successive detection steps for the A polymer 
beads, for the B polymer beads, and for the C polymer beads. 

Fig. 5 illustrates a signal, wbich might result from a 
particular defined position. Panel A illustrates the_position 
of a given colony relative to the positions corresponding to 
the positional map. The scan system will typically determine 
the amount of signal, or type of signal, at each position of 
the matrix. The scan system will adjust the relationship of 
the detector and the substrate to scan the matrix in a 
controllable fashion. An optical system with mirrors or otber 
elements may allow the relative positions of the substrate and 
detection to be fixed. The scanner can be programmed to scan 
the entire substrate surf ace in a reproducible manner, or to 
scan only those positions where polymer clusters have been 
localized. A digital data map, panel B, can be generated from 
the scan step. 

, Thus, instead of subjecting each individual and 

separated polymer to the series of reactions as a" homogeneous 
sample, a whole matrix array of different polymers targeted for 
sequencing may be exposed to a series of chemical manipulations 
in a batch format. A large array of hundreds, thousands, or 

i even miUions of spatially separated homogeneous regions may be 
simultaneously treated by defined sequencing chemistry. 
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The use of a coordinate system which can reproducibly 
assay a defined position after each reaction cycle can be 
advantageously applied according to this invention. For 
example, a colony plaque lift of polymers can be transferred 
5 onto a nitrocellulose filter or other substrate. A scanning: 
detector system will be able to reproducibly monitor the 
results of chemical reactions performed on the target polymers 
located at the defined locations of particular clones. An 
accurate positioning can be further ensured by incorporating 

10 various alignment - marlts on the substrate. 

The use of a high resolution system for monitoring the 
results of successive sequencing steps provides the possibility 
for correlating the scan results of each successive sequencing 
reaction at each defined position. 

15 The invention is dependent, in part, upon the stepwise 

synthesis or degradation of the localized polymers as 
schematized in Figures 1 and 2. The synthetic scheme is 
particularly useful on nucleic acids which can be synthesized 
from a complementary strsmd. Otherwise, a stepwise degradation 

20 scheme may be the preferred method. Although single monomer 
cycles of synthesis or degradation will usually be applicable, 
in certain cases the technology will be workable using larger 
segments, e.g., dimers or trimers, in the cyclic reactions. 
The present invention also provides methods for 

25 production or selection of monomer-specific degradative 

reagents based upon catalytic antibody constructs. Antibody 
binding sites exhibiting specificity for binding particular 
terminal monomers can be linked to cleavage reagents or active 
sites of cleavage enzymes. Thus, reagents which are specific 

30 for particular terminal nucleotides may function to remove them 
in a specific fashion. 

The invention also makes use of a means for detecting 
or labeling the polymers. Particular sequencing chemistry can 
be selected for specificity in reacting with terminal monomer 

35 units. Alternatively, indirect labeling methods may be applied 
which can distinguish between different terminal monomers. 
Another alternative scheme allows for terminal labeling which 
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is nbt monomer-specific, but with the determination of the 
monomer based upon specificity of post-label reagents or upon 
monomer-distinguishable labels. Suitable such reagents will be 
antibodies or other reagents having specificity for 
5 distinguishing between different labeled terminal monomer 

residues and cleaving only those labeled monomer residues. 

Thus, although neither the reaction nor the label need 
necessarily be specific, at least one of the pair must be 
specific. This ensures that a comparison of label signal 
10 before and after a reaction allows determination of the change 
in label signal after monomer specific reactions are performed, 
and thereby providing the means to deduce the identity of the 
monomer at a given position. 

15 B- Substrate /Matrix 

The substrate or matrix has relatively fev constraints 
on its composition. Preferably, the matrix will be inert to 
the sequencing reactions to which the polymers attached thereto 
will be subjected. Typically, a silicon or glass substrate 

20 will be used, but other .suitable matrix materials include 
ceramics, or plastics, e.g., polycarbonate, polystyrene, - 
delrin, and cellulose, and any other matrix which satisfies 
these functional constraints. 

In one embodiment, the matrix should be sufficiently 

25 nondeformable that the scanning system can reproducibly scan 
the matrix and reliably correlate defined positions with 
earlier and later scan operations. However, by including 
alignment markings on the substrate, the need for absolute 
rigidity of the substrate may be reduced. 

3Q in an alternative embodiment, the matrix may merely be 

large enough that tixe attached polymer may be separated from a 
liquid phase containing the sequencing reagents. In this 
embodiment, a single detection unit is used to analyze the 
label in a multiplicity of different samples after each of the 

35 reaction steps. Thus, different samples may be separably 
treated in distinct wells of a micr titer dish. 
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Separate homogeneous polymers can be introduced to 
solid phase beads in each microti ter well. Sequencing reagents 
may be individually introduced separately into each well, or 
transferred from well to well with the polymers remaining in 
5 the correct well due to their solid phase attachments. 

In an alternative approach, the solid phase matrix may 
be marbles or other particularly shaped articles. Spherical 
shapes, solid or hollow, are preferred because they can be 
easily transported through troughs or tubing which retains 
10 their relative orders. By feeding a succession of beads 
through appropriate reaction baths and past a detector in a 
known and retained order, a succession of label detection 
results from a bead may be correlated and converted into a 
polymer sequence. 
15 The attachment of the target homogeneous clusters of 

target polymers to the substrate can be achieved by appropriate 

linkage chemistry. As indicated before, the linkage should be 
stable and* insensitive to the sequencing reagents used. The 
specific linkages will depend, of course, upon the particular 

20 combination of substrate and polymer being used. 

Typically, the most useful chemical moieties which 
will be used are amines. Typical substrate derivatized gorups 
include aminopropyl triethoxysilane, hydroxypropylacylate, or 
hydroxy reagents, see, e.g., D.S.S.N. 07/624,120* Typical 

25 polymer derivatized groups include nitroveratryl and 

nitroveratryl oxycarbonyl. Linkage types are also illustrated 
and detailed in U.S. S.N. 07/624,120, and U.S. S.N. 07/624,114. 

Fig. € illustrates one preferred linkage chemistry for 
nucleic acids. An NVO'-derivatized nucleotide is made as 

30 described in U.S. S.N. 07/624,120. The specific conditions for 
synthesis of thymidine are described therein and are adaptable 
to other nucleotides and nucleosides. The nucleoside analog is 
further derivatized with an appropriate R group at the 3 * 
hydroxy 1. Preferred R groups are indicated in Fig. 6. The 

35 linkage produces a photosensitive blocked nucleoside suitable 
f r phosphorcunidite synthesis of further polynucleotides which 
can serve as a complementary strand for hybridization of other 
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polymers.. The hybrids of the complementary strands may be 
covalently crosslinXed using acridine dyes or other 
intercalative reagents, e.g., psoralen. See, e.g., Komberg 
(1980) nwii T?r:^ pl icatlon Freeman, San Francisco; ffiesehahn, et 
5 al. (1978) PTftc^. Watl. Acad. Sci. USA 75:2705, and Sheldon 
(1986) U.S. Pat. NO. 4,582,789 which are each incorporated 

herein lay reference. 

The linJcage should be substantially inert to the, 
cyclic sequencing reactions and scan cycles. Usually, the 

10 linkage will be at a defined and homogeneous polymer position, 
preferably at the end opposite where the sequencing chemistry 
taJces place. Although the type of lin3tage is dependent upon 
the polymer being sequenced, various types of polymers have 
preferred linkages. For polypeptides, amino terminal or 

15 carboxyl terminal lintages will be preferred. Specific amino 
terminal linkages include amino butyric acid, amino caproic 

- acids, and simlar carboxylic acids. Specific carboxyl 

terminal linkages include butyric acid, caproic acid, and other 
carboxylic acids, hydrocaa^on , and ethers. See PCT publication 

20 nos. WO91/07087 and WD9 0/15070, which are incorporated herein 
by reference. For nucleic acids, the linkages will typically 
be either 5 » or 3* linkages. Suitable 3« linkages include 
those illustrated in Figure 6., and others described in U.S. S.N. 
07/624>114. 

25 Alternatively, for complementary polymers, 

particularly nucleic acids, linkage may be via crosslinkage of 
the con^lementary polymers where the complementary stand is 
directly attached to the matrix. Acridine dyes, e.g., 
psoralen, or a similar crosslinking agent between the strands 

30 can be used. See, e,g;, Dattagupta, et al., "Coupling of 
Nucleic Acids to Solid Support By Photochemical Methods," 
U.S. S.N- 4,713,326; and U.S. S.N. 4,542,102; and CSiattejee, M. 
et al. (1990) -T, am. ghem. Soc. 112:6397; which describ useful 
crosslinking reagents, and are hereby incorporated herein by 

35 reference. 

For polynucleotides, the preferred attachment to the 

matrix is through a synthetic oligomer by the 5' end of each 
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target sequence. This oligomer is designed to anneal to the 
desired target templates used in a synthetic system or to the 
polynucleotide used in the degradation approach. In one 
embodiment, a vector sequence which is complementary to the 
5 immobilized oligonucleotide is incorporated adjacent the 
cloning inserts, thereby providing a common complementary 
sequence for each insert. In particular, a cloning vector will 
be selected with a defined sequence adjacent the insert. See, 
e.g., Sambrook , (1989) Molecular Cloning: A Laboratory 

10 Manual , Vols, 1-3, Cold Spring Harbor Press, which is hereby 
incorporated herein by reference. This defined sequence is 
used, in some embodiments, as a common linker for all of the 
vector inserts. The inserts, adjacent to this linker, will be 
transferable by hybridization to the matrix linked 

15 complementary sequences. The hybrids are crosslinked by 
addition of a suitable crosslinker under appropriate 
conditions, for example^ photocros slinking by psoralen with uv 
light. See, e.g., Song et al. (1979) Photoche m. Photobiol. 
29:1177-1197; Cimino et al. (1985) Ann. Rev* Biochem. 54:1151- 

20 1193; and Parsons (1980) Photochem. Photobiol. 32:813-821; each 
of which is incorporated herein by reference. Using these 
approaches, the oligonucleotide linker serves as both the 
attachment linker and the polymerization primer. 

Figure 6 illustrates a preferred 3' terminal linkage 
25 designed for a phosphoramidite linkage of a synthetic primer 
and the reactions forming them. The chemical reactions for 
actually performing the linkage will be similar to those used 
for oligonucleotide synthesis instruments using phosphoramidite 
or similar chemistry- Applied Biosystems, Foster City, 
30 California supplies oligonucleotide synthesizers. 

C. Scanning System 

The scanning system should be able to reproducibly 
scan the substrate. Where appropriate, e.g., for a two 
35 dimensional svibstrate where the polymers are localized to 
positions thereon, the scanning system should positionally 
define the clusters attach d thereon to a reproducible 
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coordinate system. It is iiaportant that the positional 
identification of clusters be repeatable in successive scan 
steps. Functionally, the system should he able to define, 
physical positions to a coordinate system as described above 
5 and illustrated in Figs. 3 and 4. 

In alternative embodiments, the system can operate on 
a cruder level by separately detecting sepcirate wells on a 
microtiter plate, or by scanning marbles which pass by the 
detector in an embodiment as described above and illustrated in 

10 Fig. 4. ' 

The scanning system would be similar to those used in 
electrooptical scanning devices. See , e.g., the fluorescent 
detection device described in PCT publication no. WO90/15070, 
and U.S. S.N. 07/624,120. The system could exhibit many of the 

15 features of photographic scanners, digitizers or even compact 
disk reading devices. For example, a model no. PH500-A1 x-y 
translation table manufactured by Newport Corporation can be 
attached to a detector unit. The x-y translation table is 
connected to and controlled by an appropriately programmed 

20 digital computer such as an IBM PC/AT or AT compatible 
coB^JUter- The detection system can be a model no. R943-02 

photomultiplier tube manufactured by Hamama-tsu, attached to a 
preamplifier, e.g., a model no. SR4 40 manufactured by Stanford 
Besearch Systems, and to a photon counter, e.g. , an SR430 
25 manufactured by Stanford Research System, or a multichannel 
detection device. Although a digital signal may usually be 
preferred, there may be circumstances where analog signals 

would be advantageous. 

The stability and reproducibility of the positional 

30 localization in scanning will determine, to a large extent, the 
resolution for separating closely positioned polymer clusters 
in a 2 dimensional substrate embodiment. Since the successive 
monitoring at a given position depends upon the ability to map 
the results of a reaction cycle to its effect on a positionally 

35 mapped cluster of polymers, high resolution scanning is 

preferred. As the resolution increases, the upper limit to the 
number of possible p lymers which may be sequenced on a single 
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xna'trlx will also Incirease. Crude scanning syslieins nay resolve 
only on the order of 1000 ti, refined scanning systems may 
resolve on the order of 100 fi, more refined systems may resolve 
on the order of about 10 ^ with optical magnification systems a 
5 resolution on the order of 1.0 n is available, and more 

preferably a resolution on the order of better than 0,01 ^ is 
desired. The limitations on the resolution may be diffrication 
limited and advantages may arise from using shorter wavelength 
radiation for the photo-optical deprotection fluorescent 

10 scanning steps. However, .with increased resolution, the time 
required to fully scan a matrix will be increased and a 
compromise between speed and resolution will necessarily be 
selected. Parallel detection devices which will provide high 
resolution with shorter scan times will be applicable where 

15 multiple detectors will be moved in parallel. 

With other embodiments, resolution often is not so 
important and sensitivity might be emphasized. However, the 
reliability of a signal may be pre-selected by counting photons 
and continuing to count for a longer period at positions where 

20 intensity of signal is lower. Although this will decrease scan 
speed, it can increase reliability of the signal determination. 
Various signal detection and. processing algorithms may be 
incorporated into the detection system, such as described in 
U.S. S.N. 07/624,120. In one embodiment, the distribution of 

25 signal intensities of pixels across the region of signal are 
evaluated to determine whether the distribution of intensities 
corresponds to a time positive signal. 

The detection system for the signal or label will 
depend upon the label used, which may be defined by the 

30 chemistry available. For optical signals, a combination of an 
optical fiber or cheurged couple device (CCD) may be used in the 

detection step* In those circumstances where the matrix is 

itself transparent to the radiation used, it is possible to 
have an incident light beam pass through, the substrate with the 
35 detect r located opposite the substrate from the polymers. For 
electromagnetic labels, various forms of spectroscopy systems 
can be used. Various physical orientations for the detection 
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system are available and discussion of ojoportant design 
parameters is provided, e.g., in Jovin, Adv. in Biochem. 
Bioplyms, wliich is hereby incorporated herein by reference. 

Various labels which are easily detected include 
5 radioactive labels, heavy metals, optically detectable labels, 
spectroscopic labels and the like. Various photoluminescent 
labels include those described in U.S. S.N. 07/624,114. 
Protection and deprotection are described, e.g., in McCray, et 
al. (1989) 3^T>n. Rev. Bi ^rtiY^i^^l Chemistry 18:239-270, and 

ao tJ.S.S.N. 07/624,120, each of which is hereby incorporated 
herein by reference. 

With a processing system, the speed of scanning may be 
dramatically increased, with a system which only scans positions 
where taiown clusters of polymer are attached. This allows the 

15 scanning mechanism to s)cip over areas which have been 

determined to lack any polymer clusters and avoids loss of time 
in scanning useless regions of the matrix. Moreover, various 
problems with spurious or overlapping signals may be adjusted 
for by appropriate ansilysis. 

20 A scanning apparatus which may be used for the 

presently described uses is schematically illustrated in • 
Fig. 7. A substrate 52 is placed on an x-y translation 
table 54. In a preferred embodiment the x-y translation table 
is a model no. EM500-A1 manufactured by Newport Corporation. 

25 The x-y translation table is connected to and controlled by an 
appropriately programmed digital computer 56 which may be, for 
example, an appropriately programmed IBM PC/AT or AT compatible 
computer, of course, other computer systems, special purpose 

hardware, or the like could readily be substituted for the AT 

30 con^uter used herein for illustration. Coa^uter software for 
the translation and data collection functions described herein 
can be provided based on commercially available software 
including, for example, "Lab Windows" licensed by National 
instruments, which is incorporated herein by reference for all 

35 purposes. 

The substrate and x-y translation table are placed 
under a microscope 58 which includes one or more objectives 60. 
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Light (about 488 nm) from a laser 62, which in some embodiments 
is a model no. 2020-05 argon ion laser manufactured by 
Spectraphysics, is directed at the substrate by a dichroic 
mizTor 64 which passes greater than about 520 ran wavelength 
5 light but reflects 488 nm light. Dichroic mirror 64 may be, 
for example, a model no. FT510 aemufactured by Carl Zeiss, 
Light reflected from the mirror then enters the microscope 58 
which may be, for example, a model no, Axioscop 20 manufactured 
by Carl Zeiss. Fluorescein-mar3ced materials on the substrate 

10 will fluoresce >488 nm light, and the fluoresced light will be 
collected by the microscope and passed through the mirror- The 
fluorescent light from the substrate is then directed through a 
wavelength filter 66 and, thereafter through an aperture plate 
€8. Wavelength filter 66 may be, for example, .a model no. 

15 OG530 ffiemufactured by Melles Griot and aperture plate 68 may 
be, for example, a model no. 477352/477380 manufactured by Carl 

Zeiss. 

The fluoresced light then enters a photomultiplier 
tube 70 which in one embodiment is a model no. R943-02 

20 manufactured by Hamamatsu, the signal is amplified in 

preamplifier 72 and photons are counted by photon counter. 74. 
The number of photons is recorded as a fiinction of the location 
in the computer 56. Pre-Amp 72 may be, for example, a model 
no. SR440 manufactured by Stamford Research Systems and photon 

25 counter 74 may be a model no. SR430 manufactured by Stanford 
Research Systems. The substrate is then moved to a subsequent 
location and the process is repeated. In preferred embodiments 
the data are acquired every 1 to 100 ^m with a data collection 
diameter of about 0.8 to 10 ^m preferred. In embodiments with 

30 sufficiently high fluorescence, a CCO detector with broadf ield 
illumination is utilized. 

By counting the number of photons generated in a given 

sirea in response to the laser, it is possible to determine 
where fluorescent marked molecules are located on the 
35 substrate. Consequently, for a substrate which has a matrix of 
polypeptides, for example, synthesized on the surface thereof. 
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it ife possible to determine which of the polypeptides has 
incorporated a fluorescently marked monomer. 

According to preferred embodiments, the intensity and 
duration of the light applied to the substrate is controlled by 
5 varying the laser power and scan stage rate for improved 

signal-to-noise ratio by maximizing, fluorescence emission and 
minimizing background noise. Signal analysis may improve the 
resolution and reliability of the system. The time of photon 
counting may be varied at various positions to provide high 
10 signal to background or noise, 

D. Svnthetin or- Deor aHative Cvcle 

The. present invention provides a substrate with 
positionally separated polymers for sequencing. The separation 

15 may be by solid phase carriers separated in separate wells, by 
separately manipulable carriers such as beads or marbles, or by 
physical separation ot regions on a two-dimensional substrate 
surface. Each cluster region is a target for the sequencing 
reactions. Although the reactions are, in various embodiments, 

20 performed on all the clusters together, each cluster can be 
individually analyzed by following the results from the 
segu^e of reactions on polymer clusters at positionally 

defined locations. 

The synthetic mode, as illustrated in Fig. 1 is easily 

25 applied to the sequencing of nucleic acids, since one target 
strand may serve as the template to synthesize .the 
co3i5>lementary strand. The nucleic acid can be DNA, RKA or 
mixed polymers. For the purposes of illustration, and not by 
limitation, the sequencing steps for DNA are described in 

30 detail. The synthetic mode, an example a£ ^ich is depicted in 
Fig. 8 for nucleotides, may also be useful in circumstances 
where synthesis occurs in response to a known polymer sequence. 
The synthetic scheme depends, in part, on th stepwise 
elongation by small and identifiable units, A polymerase is 

35 used to extend a primer complementary to a target template. 
The primer is elongated one nucleotide at a time by use of a 
particular modified nucleotide analog to which a blocking agent 
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is added and which prevents further elongation. This blocking 
agent is analogous to the dideoxy nucleotides used in the 
Sanger and Coulson sequencing procedure, but in certain 
embodiments here, the blockage is reversible. This analog is 
5 also labeled with a removable moiety, e.g., a fluorescent 
label, so that the scanning system can detect the particular 
nucleotide incorporated after its addition to the 
polymerization primer. 

Pcuiel 4A illustrates the cycle of sequence reactions 

10 in one embodiment. The template polymer 82 located at a 
particular site has already been linked to substrate. The 
template 82 and complementary primer 84 are hybridized. Often, 
the primer 84 is common to all of the target template 
sequences, selected by its common occurrence on a selected 

15 cloning vector. The primer 84 is also often covalently 

crosslinked to the target template 82 using psoralen and U.V. 
light. 

— ' Labeled and blocked monomers 86 are shown, the leU^el 
depicted by the asterisk and the polymerization blocking groups 

20 indicated by B. • A compatible polymerase 88 which can elongate 
the primer with the labeled blocked monomers 86 is used in 
reaction 1. In the preferred embodiment, the separate labeled 
monomers can be distinguished from one another by the 
wavelength of fluorescent emission. 

25 In the example illustrated, a labeled blocked 

m 

guanosine monomer has been incorporated into the elongated 
primer 90. 

step 2 is a scan, where the signal at the position 
corresponding to template 82 indicates that the guanosine 

30 analog was incorporated. Reaction 2 is performed, a removal of 
both the label emd blocking group. It will be recognized that 
the blocking group prevents elongation by any more than a 
single nucleotide in each reaction cycle. Reaction 3 is 
equivalent to reaction 1, though the siibstrate primer has been 

35 elongated by one monomer. 

Panel B illustrates the scheme in a logic flow chart. 
The template 82 is attached to the stibstrate, either directly 
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or througli the primer. Reaction 1 elongates the primer by a 
single labeled blocked nucleotide. A scan step is performed 
and the blocking and labeling agents are removed, "nhe 
elongation reaction is performed and the cycle repeated. 
5 For a nucleic acid, a unit for addition would 

typically be a single nucleotide. Under certain circumstances, 
dimers or trimers or leirger segments may be utilized, but a 
larger number of different possible nucleotide elements 
requires high distinguishability in other steps. For example, 

10 there are only four different nucleotide monomer possibilities, 
but there are sixteen different dimer possibilities. The 
distinction among four possibilities is more precise and simple 
than among sixteen dimer possibilities . To prevent elongation 
by a unit length greater than one monomer, the nucleotide 

15 should be blocked at the position of 3' elongation. Usually, 
the nucleotide will be blocked at the 3 • hydroxyl group where 
successive nucleotides would be attached. In contrast to a 
dideoxy nucleotide, typically the blocking agent will be a 
reversible blocking agent thereby allowing for deblocking and 

20 sxibsequent elongation. 

Variations may be ieasily incorporated into the. - 
procedure. If the labels on the monomers are not 
distinguishable, successive substrate scans can be performed 
gft^j. each monomer is provided conditions allowing its 
25 incorporation. Alternatively, a small fraction of permanently 
blocked but reversibly labeled monomers may be incorporated. 
Those specif ic molecules which incorporate the blocked monomers 
are permanently removed from further .polymerization, but such 
is acceptable If- the labeling moiety is also removed. 

30 

1, tot her monomers 

one important functional property of the monomers is 
that the label be removable. The removal reaction will 
preferably be achieved using mild conditions. Blocking groups 
35 sensitive to mild acidic conditions, mild basic conditions, or 
light are preferred. The label position may be anywhere on the 
molecule compatible with appropriate polymerization, i.e., 
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complementary to the template, by the selected polymerase. A 
single polymerase for all of the modified nucleotide is 
preferred, but a different polymerase for each of the different 

jDonomers can be used. 
5 Nucleotide analogs used as chain-terminating reagents 

will typically have both a labeling moiety and a blocking agent 
while remaining compatible with the elongation enzymology. As 
the blocking agent will usually be on the 3 * hydroxyl position 
of the sugar on a nucleotide, it would be most convenient to 

10 incorporate the label and the blocJcing agent at the same site, 
providing for a single reaction for simultaneous removal of the 
label and blocking agent. However, it is also possible to put 
a label on another portion of the nucleotide analog than the 3 • 
hydroxyl position of the sugar, thereby requiring a two-step 

15 reaction cycle for removing the blocking and labeling groups. 

Analogs will be found by selecting for suitable 
combinations of appropriate nucleotides with compatible 
polymerases. In particular, it is desired that a selected 
polymerase be capable of incorporating a nucleotide, with 

20 selectivity, having both the blocking moiety and the label 

moiety attached. It has been observed that KNA polymerases are 
less fastidious with respect to the nucleotide analogues which 
will be polymerized into a growing chain. See, e.g. , 
Rozovaskaya, T., et al. (1977) Wolekulvamava Bjolocriya, 

25 ll:598-610r Kutateladze, T., et al. (1986) IloleJailyaimCa 

Rioloatva . 20:267-276; smd Chidgeavadze, 2., et al. (1985) F E BS 
Letters . 183:275-278. Moreover, those references also indicate 

that rather significant chemical moieties may be attached at 
the 2V or 3' positions on a nucleotide, and still be correctly 
30 incorporated at the growing chain terminus. 

In particular, it is not necessary that the same 

nucleotide have, both the reversible blocking moiety and the 
removable labeling moiety, as a combination of two separate 
nucleotide analogues could be utilized, e.g., Nl, which is ^ 
35 reversibly blocked and not labeled, and N2, which is 

irreversibly blocked but removably labeled. Note that the 
removal of label may be affected by destruction of the label. 
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e.g.*, fluorescence destruction, or preferably by removal. Both 
of these nucleotides might be, for instance, A emalogues. With 
the mixture, at an appropriate sequence position of a target 
sequence, an Nl and N2 nucleotides can be incorporated at an 

5 appropriate ratio, and these can be polymerized by either two 
separate polymerases, or preferably a single polymerase. 

For example, two separate polymerases might be 
necessary, PI which incorporates Kl, and P2 which incorporates 
N2. At the given location in the sequence, some of the growing 

10 polymers will incorporate Nl with PI polymearase, and others 

will incorporate N2 with the P2 polymerase. The proportions of 
Nl, N2, PI, and P2 may be titrated to get the desired 
fractional proportions of the Nl .reversibly blocked nucleotides 
and the N2 labeled but irreversibly blocked nucleotides. 

15 As all of the growing chains have blocked nucleotides, 

no elongation takes place beyond a single nucleotide. The N2 
nucleotides provide a specific label, detected in the scanning 
step. After determination of the incorporated label, the label 
may be removed or destroyed, and those irreversibly terminated 

20 growing chains become permamently removed from further . 

participation in the sequencing process. Photodestruction may 
be achieved by a high intensity laser beam of the correct 
wavelength. See, e.g., March (1977) Advanced Organic 
Chemistry; Reactions. Mechanisms and Structure (2d Ed) McGraw; 

25 and Carey and Sundberg (1980) Advanced Organic Chemistry; part 

A structure and Mechanisms . Plenum. 

Next, the reversible blocking moiety is removed, 
providing a new set of slightly longer polymers ready for the 
next step. Of course, the amount of label necessary to be 

30 incorporated must be detectable, preferably with a clear, 

unambiguous positive signal. The amount of label dLncorporated 
will depend, in part, upon the conditions in the pblymizing 
step and the relative incorp ration of the Nl and N2 
nucleotides. The proportions of the nucleotides, polymerases, 

35 and other reagents may be adjusted to appropriately incorporate 
the desired proportions of the nucleotides. 
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In an embodiment where a single polymerase will 
incorporate both Nl and N2, the relative proportions and 
conditions to get the correct incorporation levels of the two 
nucleotides can be titrated. In an alternative preferred 
5 embodiment, a single nucleotide will have both the removable 
label and the reversible blocking moiety. 

A similar approach may be necessary where only some 
fraction of the nucleotide zmalogues is labeled. Separate 
polymerases might also be useful for such situations, and each 

10 polymerase may have special conditions necessary for activity* 

Procedxires for selecting suiteible nucleotide and 
polymerase combinations will be readily adapted rom Ruth et al. 
(1981) Molecular Pharmacology 20:415-422; Kutateladze, T. , et 
al. (984) Nuc . Acids Res . . 12:1671-1686; Kutateladze, T., et 

15 al. (1986) Hole}ailvamava Bioloaiva 20:267-276; Chidgeavadze, 
2., et al. (1985) FEB5 Letters . 183:275-278; and Rozovskaya, 
T. , et al. (1977) Molelculvamaya Bioloaiva 11:598-610., _ 

The determination of termination activity is done in 
■two steps. First, nucleotide analogues are screened for the 

20 ability of the compound to inhibit polymerase activity. Then 
the nucleotide analogue is tested for base-specific termination 
as manifested by generating a correct DNA sequencing ladder on 
a template of ]cnown sequence.. The appropriate reaction 
conditions are those used for conventional sequencing reactions 

25 with the respective polymerases. The conditions cure then 

modified in the usual ways to obtain the optimal conditions for 
the particular terminator compovmd (e.g. concentration of 
terminator, ratio of terminator to dNTP, Mgtt, and other 
reagents critical to proper polymerase function. 

30 By way of example, an approach employing the 

polymerase known as reverse transcriptase (AK7) will be 
described. The initial conditions are essentially as described 
by Prober, et al. (1987) Science 238: 336-341. 

A nucleotide analogue is first selected from the group 

35 available from a commercial source such as Amersheun, New 
England Nuclear, or Sigma Chemical Company. In particiilar. 
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iwcleotides which are reversibly blocked from further 
elongation, especially at the 5- or 3' -OH will be used. 

General properties which are desired have been 
described. _Eaeh of these analogs can be tested for 
5 compatibility With a particular polymerase by testing whether 
such polymerase is capable of incorporating the labeled analog, 
various polymerases may be screened, either natural fonas of 
the mentioned types, or variants thereof. Polymerases useful 
in connection with the invention include E. Coll DHA polymerase 

10 (Klenow fragment) ; are Klenow and Henningsen (1970) T>P?P- Haf 1 
^nart sel PSA 65:168; and Jaoobsen et al. (1974) Jtw i r, J- 
SiaSiSSE. 45:623? modified and cloned versions of T7 DNA 
polymerase (Sequenase and Seguenase 2.0); see Tabor and 
Richardson (1987) yrnr N.tM ftr.d. sci. USA 84:4767; and Tabor 

15 and Richardson (1987) .T. Biol. Chem. 262:15330; Tag DNA 

polymerase from thermostable Thermus agaaticus; see Chxen et 
al (1976) .T- Bacterol . 127:1550; and its cloned version 
toplitag; Saiki and Gelfand (1989) aPpH f i ?^ttons 1:4; T4 DHA 
polymerase; see Nossal (1974) .t Biol- Chem. 249:5668, and 

20 various reverse transcriptases, both RNX- and DMA- dependent 
Dsa polymerases, e.g., avian retroviruses; see Houts (1970) 1. 
virology 29:517; and murine retroviruses; see Kotewica et al. 
(1985) Ssne 85:249; Gerard et al. (1986) Sm 5:271; and Bst 
polymerase, see Ye, S. and Hong (1987) pri>.ntj.a Slnica 30:503- 

25 506. , ...^ • 

in order to ensure that only a single nucleotide is 

added at a time, a blocking agent is usually incorporated onto 

the 3- hydroxyl group of the nucleotide. Optimally, the 

blocking agent should be removable under mild conditions (e.g., 

30 photosensitive, weak acid labile, or weaJc base labile groups), 
thereby allowing for further elongation of the prW strand 
with a next synthetic cycle. Xf the blocking agent also 
contains tlie fluorescent label, the dual blocking and labeling 
functions will be achieved without the need for separate 

35 reactions for the separate moieties. 

The blocking group should have the functional 
properties of blocking further elongation of the polymer. 
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Additional desired properties are reversibility emd inertness 
to the sequencing reactions. Preferably, where an enzymatic 
elongation step is used, the monomers should be compatible with 
the selected polymerase. Specific examples for blocking groups 
5 for the nucleic acids include acid or base labile groups at the 
3* OH position. See, e.g.. Gait (1&84V Oligonucleotide 

I 

Synthesis: A Practical Approach . IHL Press, Oxford. 

A DNA-dependent DMA polymerase is the polymerases of 
choice. Polymerases used for conventional DNA sequencing, for 

10 example, Klenow fragment of E.coli DNA Pol, Seguenase (modified 
T7 DNA polymerase). Tag f Thermus aouaticus ^ DNA polymerase. Est 
fBacillus stearot hermophilus^ . DNA polymerase, reverse 
transcriptase (from AMV, MHLV, RSV, etc.) or other DNA 
polymerases will be the polymerases of choice. However, there 

15 is a functional constraint that the polymerase be compatible 
with the monomer analogues selected. Screening will be 

performed to detexmine appropriate polymerase and monomer 
analog combinations. 

Removal of the blocking groups may also be xinnecessary 

20 if the labels are reniovable. In this approach, the chains 
incorporating the blocked monomers are permanently terminated 
and will no longer participate in the elongation processes. So 
long as these blocked monomers are also removed from the 
labeling process, a small percentage of permanent loss in each 

25 cycle can also be tolerated. 

The fluorescent leOsel may be selected from ojiy of a 
number of different moieties. The preferred moiety will be a 
fluorescent group for which detection is quite sensitive. 
Various different fluorescence-labeling techniques are 

30 described, for example, in Gambara et al . (1988) "Optimization 
of Peurameters in a DNA Sequenator Using Fluorescence 
Detection," Bio/Technol . 6;S16-821; Smith et al . (1985) Nucl. 
Acids Res. 13:2399-2412; 2md Smith et al . (1986) Nature 
321:674-679, each of which is hereby incorporated herein by 

35 reference. Fluorescent labels exhibiting particularly high 
coefficients of destruction may also be useful in destroying 
nonspecific background signals. 
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Appropriate blocking agents include, aBong others, 
Ught sensitive groups such as 6-nitoveratryl-o^bonyl 
(NVOC), 2-nitobenzyloxycarhonyl (NBOC), . . ^_ 

SethUenzyloxycarbonyl (DD.) , .5-broMo-7-n.tro.ndolx„yl^o- 
S ZroJ-.-^^n cinna^oyl, 3-oxy.ethylene anthra<^.none, a«i 
t-butyl oxycarbonyl (TBOC). other blocking reagents are 

t-fiUtyi oxyt.ai^ I .„.<.4„„ „„ WO90/15070; Patchomilc 

discussed, e.g., in PCT publication no. wo90/i5 , 

^ incorporated herein 

10 by reference. Additional blocking agents attached to 

particular positions way be selected according to the 

functional directives provided herein. 

Pig. 9 schematically Ulustrates the synthesis of a 

generic protected nucleotide. A suitable nucleotide is labeled 
,3 ::rr ^00 ^ently detectab. 

the conditions described, e.g., xn O.S.S.N. 07/ . 

(TMS-Cl) , I«0C-C1, and H,0. A protection BOiety will be 

using conditions also described there. 

various nucleotides possessing features useful in the 
20 described x^thod can be readily synthesized. 

are attached at. appropriate sites on the n-^-f^^^^'^^^^, 

...^.try and conditions as ^.tlir led 

a. ri:SeTalso outlines various reactions «hic. lead to useful 
""^'^'''''^tionally, the selected polymerases used i. 

— t^dt: pt: Lrtr^^^^^^ 
30 :::e^rs:re:u::s L^udeotide - p--^— 

be devised to -^J^t^te^i r ^l^rdirec^ 

functional. A test using primer witn rei^ „„^„„ted will 
tbe addition of the nucleotide analog *° ''"^^ 

defined conditions. 
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The degradative scheme is generally lllust:rated in 
Fig. 1, an example more generally applicable to biological 
macronolecular polymers is depicted in Fig. 10. This method is 
xiseful for a wider variety of polymers without the limitations 
5 imposed by the need to replicate the polymer. The degradative 
sequencing technique depends, in part, upon the aJDility to 
specifically label or distinguish between various different 
terminal monomers at particuleu: matrix positions. Reactions 
for specific removal of a defined monomer unit are important. 

10 This monomer distinguishability can arise from an 

ability to differentiate between label on the various possible 
monomers in the polymer. As a second means, distinguishability 
can come from specific reagents which react with particularity 
on different monomers. Thus, for instance, labels may be used 

15 which generally attach to the terminal nucleotide, but whose 
fluorescent signal differs depending upon the nucleotide. As a 
third means, a reagent which specifically affects the label on 
only one monomer may be used, as described below. 

In the first example, every polymer cluster will be 

20 labeled at a particular end, e.g., the 5* end, without 

specificity for the monomer located there. The scan step. will 
be able to distinguish the terminal monomers, after which each 
labeled terminal monomer is specifically removed. The general 

label step is repeated in the cycle as described. 

25 In the second meeuis for distinguisheO^ility, reagents 

are used which produce a signal which is dependent upon the 
terminal nucleotide. For example, a labeling molecule which 
binds only to one specific terminal monomer will provide a 
monomer specific label. . This will provide a cycle much liJce 

30 the first means for distinguishability where the properties of 
the label is different depending upon the terminal nucleotide 
to which each specific labeling reagent binds. 

In the third means for distinguishaibility, an 
individual reagent labels or affects only a specific terminal 

35 monomer. Polymers susceptible to each reagent by virtue of 
terminating with the corresponding monomer will have its label 
specifically affected. A scam of the matrix after each step 
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• and comparison with the earlier scans will determine which 
positions correspond to polymers ending with a susceptible 
monomer- Performing a removal step with a second monomer- 
specific reagent followed by a scan will identify those 
5 positional locations having polymer clusters ending with that 
second monomer. A similar reagent for the other possible 
monomers will further define all of the possibilities. 
Finally, when all of the possible monomers have been removed, 
the labeling reaction may be repeated and the succession of 
10 specific reagent and scanning steps will also be repeated. 
This procedure allows for a succession of automated steps to 
determine the sequence of the polymer, clusters localized to 

distinct positions. 

Finally, a combination of both specificity of reagent 

15 and ability to distinguish label on different monomers can be 
utilized. Neither alone need be relied upon exclusively. 
•Ehus, in the case of nucleotides, an ability to distinguish 
into two separate classes of nucleotides, e.g., A and C from G 
and T, combined with specif ic reagents for distinguishing 

20 between the indistinguishable label pairs, e.g., in the example 

provided, A from C, or G from T, can also provide sufficient 
information for sequencing. 

instead of performing four specific reactions on the 
same substrate natrix, each of the four individual reactions 

25 cau be performed on separate parallel matrices. Four separate 
substrate matrices may be made by a replica plating or 
successive transfers, each matrix having the same spatial 
distribution of polymer clusters. Thereby, each separate 
substrate can be subjected to only a single specific reagent an 

30 a Highly optimized reaction. On each cycle, one out of the 
four parallel substrates should show a signal indicating the 
monomer at the terminal ' for the cluster, at a given matrix 

position. ^ 

Ijjcewise, two parallel substrates can be provided, and 

" 35 each of the parallel substrates is used to determine two of the 
four possible nucleotides at each position. Instead of 
treating a single matrix vith four separate reactions, this 
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approach allows treating each of two substrates with only two 
separate reactions. By kinimizing the niamber of reactions to 
which each chip is exposed, the side reactions will be 
minimized, the chemistry will be optimized, and the number of 
5 cycles through which a matrix will survive will be optimized. 
This provides an advantage in the number of cycles to which a 
matrix can be subjected before the signal to noise becomes 
indistinguishable, 

10 E- Label 

The label is important in providing a detectable 
signal. The signal may be distinguishable eunong the various 
monomers by the nature of the signal, e.g., wavelength or other 
characteristic, as described in Prober et al. (1987) Science 

15 238:336-311. A monomer-specific reagent can allow 

determination of whether each position has a par±icular 
terminal monomer by the presence or loss of label. 

The label on the monomer may be a noncovalent 
attachment, but will be preferably a direct covalent 

20 attachment. The label will typically be one which is capable 
of high positional resolution and not interfere with the 
nucleotide-specific chemistry or enzymology. Although many 
different leibels may be devised including enzyme linJeed 

immunosorbent assays (ELISA) , spectrophotometric labels, light 
25 producing or other labels, a fluorescent moiety is the 

preferred form. For example, an avidin/biotin type affinity 

binding may be useful for attaching a particular label. 

Alternatively, an antibody may be used which is specific for 

binding to a psurticular terminal monomer. A wide variety of 
30 other specific reagents can be used to provide a labeling 

function. See , for example, U.S. S.N. 07/624,114. 

The means of detection utilized will be selected in 

combination with various other considerations. In some 

circumstances, a spectroscopic label may be most compatible 
35 with a particular monomer. Enzyme linked assays with a 

spectrophotometric detection system are a worlcable system. 

Ph sphorescent or light producing assays provide high 
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'sensitivity using charged couple devices. Fluorescent systems 
provide the same advantages, especially where the incident 
light beam is a laser. The fluorescent label also may provide 
the added advantage of fluorescing at different wavelengths for 
5 the different monomers, providing a convenient means to 

distinguish between different monomers. Other forms of label 
may be desired for various reasons, for example, magnetic 
labels, radioactive labels, heavy metal atoms, optically 
detectable labels, spectroscopically detectable labels, 
10 fluorescent labels, and magnetic labels. 

For sequencing nucleic acids by this method, the 
labeled monoBers are simpler thcin those monomers used for the 
synthetic method. The blocking group is unnecessary, but 
terminal specific reagents are more difficull: to produce, 
15 The. preferred attachment sites will be at the. same 

location as the blocking site, so a combined leibel and blocking 
moiety is more preferred. The label will be attached as 
described, e.g., in U.S. S.N. 07/624,114. 

Two types of degradation cycles can be used, either 
20 non-specific removal of the terminal leibeled nucleotide, or a 
base-specific removal. With the non-specific removal- means, 
each of the end monomers, when labeled, shoulii be 
~ distinguishable from the other three monomer possibilities . 
This allows for determination of the terminal nucleotide for 
25 the cluster localized at a given matrix- position. Then the 
terminal, labeled nucleotides are non-specif ically removed and 
the newly exposed terminal nucleotides will be again 

distinguishably labeled. 

By this scheme, a specific label for each of the 

30 different nucleotides may be provided. For example, 

fluorescent reagents specific for each of the nucleotides may 
provide a signal with a different wavelength. This will more 
usually occur when the fluor scent probe is located near the 
base m iety of the nucleotide In the scanning step, the 

35 regions terminating with each of the four different nucleotides 
may be determined. Then, . a reaction is performed removing the 
labeled terminal nucleotides from all of the polymers. This 
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removal may be either enzymatic, using a phosphataBe, an 
exonuclease or other similar enzyme, or chemical, using acid, 
base, or some other, preferably mild, reagent. Again, the 
reactions are performed which label each of the terminal 
5 nucleotides and a scan step repeated in the same manner. In 
the base-specific removal scheme, nucleotide-specif ic removal 
can be performed. For example, an enzyme which will function 
to remove only a single modified nucleotide, e.g., a 5*-, 
fluorescein-dAMP-specif ic exonuclease, is constructed. This 

10 may be achieved by proper construction of a catalytic antibody. 
Other similar reagents may be generated for each of the other 
labeled nucleotide monomers. 

Catalytic or derivatized antibodies to catalyze the 
removal of the 3»-end or 5* -most fluorescent base in a base- 

15 specific manner may be constructed as follows. A recombinant 
antibody library or a series of monoclonal antibodies is 
screened with fluorescent donor-quencher substrates. These 
substrates consist of a fluorescent labeled base (At'C, g, or 
T) on the 5* or 3' end joined by a 5* to 3' phosphodi ester 

20 linkage to a second base. A collection of all four possible 
second bases for each of the four end bases gives the best 
selection target for the required non-specificity with respect 
to the second base. The second base is then tethered to an 
acceptor group in sufficient proximity to quench the 

25 fluorescence of the end group. In the presence of a catalytic 
antibody with cleaving activity, a fluorescent signal occtirs 
from the separation of the quenching group from the terminal 
fluorescent label. To assure both base and end specificity, 
the positive monoclonal antibody clones are rescreened against 

30 the other substrates. 

Upon selection of an antibody exliibiting the desired 
specificity (or lack thereof) , the reactive group for cleavage 
may be attached. This cleavage reagent may be chemical or 
enzymatic and will be attached by an appropriate length linker 

35 to the antibody binding site in an orientation ^rtiich is 

consistent with the steric requirements of both binding eind 
specific cleavage. 
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Particularly useful specific reagents may be produced 
Bia3cing antibodies specific for each of the four different 
laodified terminal nucleotide .bases* These iantibodies would 
then specifically bind only to polymers terminating in the 
5 appropriate base analog. By combining a cleavage reagent to. 

the specific antibody, a terminal nucleotide specific cleavage 
" reagent is generated. 

In one exzunple of the degradative embodiment, all of 
the polymers may be uniformly labeled at a particular end* 
10 Thereafter, a specific removal reaction which removes only a 
particular nucleotide may be performed, leaving the three other 
nucleotides labeled. Thereafter, a scanning step is performed 
through which all regions which had incorporated that 
particular nucleotide will have lost the label through specific 
15 removal. Then, the second specific reagent will b© applied 
which specifically removes the second labeled nucleotide, and 
the scanning step following that reaction will allow 
determination of all regions irtiich lose the second particulsur 
nucleotide. This process is repeated with reagents specific 
20 for each of the last two remaining labeled nucleotides 

interspersed with, scanning steps, thereby providing information 
on regions with each of the nucleotides located there. Then, 
the entire process may be repeated by labeling the next 
terminal nucleotides uniformly. As mentioned below, 

25 replication techniques may allow for maJcing f our separate but 
identicEuL matrix substrates. Each substrate may be subjected 
to single nucleotide-specific reactions, and the scan results 
correlated with each of the other parallel substrates. 

In the degradation scheme, the polynucleotide linlcage 

30 to the matrix must be more carefully selected such that the 
free end of the oligonucleotide segments used for atta chm e n t 
will not interfere with the determinations of the target 
s quence terminus. 

35 F. Utility 

The present sequencing method is useful to monitor and 
check the accuracy and reliability of the synthetic pr cesses 
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as described in PCT piiblication no. W090/15070. The present 
method can be used to check the final products synthesized 
therein, or to label each monomer as they are added stepwise to 
monitor the efficiency and accuracy of those synthetic methods. 
5 The present invention can also be used to monitor or 

sequence matrix bound clusters of positionally distinct 
polymers. This sequencing process provides the capability of 
simultaneously sequencing a large plurality of distinct 

polymers which are positionally segregated. 

method will be used to sequence extremely large 
stretches of polymer, e.g., nucleic acids. A large number of 
shorter segments of a large sequence can be sequenced with 
alignment of overlaps either randomly generated, or in an 
ordered fashion, or particular sequenceable segments of a large 
15 segment can be generated. In one approach, a large segment is 
subcloned into smaller segments and a sufficient number of the 
randomly generated subclones are sequenced as described herein 
to provide sequence overlap and ordering of fragments. 

In an alternative approach, a large segment can be 
20 successively digested to generate a succession of smaller sized 
subclones with ends separated by defined numbers of monomers. 
The subclones can be size sorted by a standard separation 
procedure and the individual samples from a separation device 
manually or automatically linked to a matrix in a defined 
25 positional map. Fractions resulting from size separation can 
be spatially attached at defined positions, often at adjacent 
positions. Then polymer sequences at adjacent positions on the 
matrix will also be known to have ends which differ by, e.g., 
approximately 25 or 50 or more monomers, thereby providing 
30 significantly greater confidence' in overlapping sequence data. 

III. Specif |i?^hft^^p.opt-e 

A specific series of reactions for sequencing a matrix 
of polynucleotides is described. 
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*A, syp-b^etic Method 

This method involves annealing a primer (common to all 
the attached sequences by virtue of the cloning construction) 
near to the 3' end of the unknown target sequences. DMA 
5 polymerase, or a similar polymerase, is used to extend the 
chains by one base by incubation in the presence of dNTP 
analogs which function as both chain terminators and 
fluorescent labels. This is done in a one-step process where 
each of the four dNTP analogs is identified by a distinct dye, 
10 such as described in Prober et al . Science 238:336-341, or in 
four steps, each time adding one of the four bases, 
inteirspersed with a scanning identification step- When each 

cluster incorporates the proper one of the four bases and the 
fluorescence scanning is complete, the matrix is stripped of 
15 the label and the chain terminators are deblocked, for a next 
round of base addition. Because the base addition is directed 
by the template strand, the complementary sequence of the 
fragments at each address of the matrix is deduced- 

20 (1) Attachment to a surface. 

Both degraditive and synthetic sequencing methods 
begin by obtaining and immobilizing the target fragments of 
unknown sequence to be determined at specific locations on the 
surface. 

25 There are several strategies for photo-directed 

attachment of the DNA strands to the surface in an orientation 
appropriate for sequencing. A caged biotin technique, see, 
e.g., POT publication no. WO91/07087, is available. Another 
technique that is especially applicable for the enzymatic 

30 synthesis method is to chemically attach a synthetic oligomer 
by the 5* end to the entire surface (see Fig. 6) , to activate 
it for photocrosslinking (with psoralen, for example) and to 
anneal the complementary strands and photocrosslink the target 
strand of unknown sequence (complementary to this 

35 oligonucleotide at tiie 3 • end) at the specific location 

addressed by light. In this case, the oligonucleotide s^es 

as both the attachment linker and as the synthetic primer- A 
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third method is to physically transfer individual nucleic acid 
samples to selected positions on the matrix, either msuiually or 
automatically - 

Many sequences in each step are attached by cloning 
5 the library into a series of vectors identical except for the 
5ec[uences flanking the insert.. These primers can be added at 
the pcint of amplification of the cloned DNA with chimeric 
primers, . . 

Alternatively, sequences are attached to a matrix 

10 substrate by colony or phage immobilization. This directly 
transfers the positional distribution on a petri plate to a 
losable substrate. Colonies representing a shotgun collection 
of sequences (enough to assure nearly complete coverage by 
overlap) are spread over (or in) a nutrient surface at a 

15 density to give about 100 or more colonies or plaques in 
several square centimeters, and the colonies are allowed to 

■ I 

grow to about 0.1 mm in diameter (the maximum possible density 
of clusters at this size is -10,000 colonies/cm^). As 
described above, replica platings or successive transfers may 
20 allow for preparation of multiple matrices with identical 

positional distributions of polymers. £ach sepsurate matrix may 
then be dedicated to the reactions applicable to a single 
monomer . 

For example, in the use of a phage library, on a petri 
25 dish, the transfer substrate surface is treated to release DKA 
from the phage. This is done, e.g., with CHCI3 vapor, SDS- 

NaOH, or by heating. Prior to release of DNA, the phage 
particles are often adsorbed to the surface by way of em 
euitibody to the coat protein that has been immobilized on the 
30 surface. This strategy prevents diffusion of the phage from 
the colonies. The matrix surface is prepared by coating with 
an oligonucleotide, immobilized to the surface by one end that 
has homology with the phage vector DNA adjacent to the cloning 
site. 

35 The matrix surface is juxtaposed to the growth 

surface, £md the phage DHA is allowed to anneal to the 
immobilized oligonucleotide* The growth surface is removed. 
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and the hybrid is stabilized by psoralen or an equivalent 
crosslinlcing reagent. 

This method provides an efficient one-step method of 
placing many DNA fragments onto the detection surface in 
5 preparation for sequencing. Although the colonies are not 
placed in predefined locations, the random arrangement of the 
clusters allows the final sequence to be assembled from 
correlation of overlap sequence data derived from sequence data 
derived from each of the defined positions of each target 
10 cluster. 

Sequences are, in other embodiments', attached by a 
manual or automated transfer technique. A few cells from each 
colony in a library is toothpicked into microliter wells. The 

plate is heated to — lOO'C for a sliorl: period to lyse tlie cells 
15 and release the DNA . ' The plate is cooled and reagents for 
cycled amplification of the DNA using, e.g. , PGR technology, 
are added, including primers common to all the cloned 
sequences. See , e.g., Innis et al . (1990) PGR Protocols; A 
Guide to Methods and Applications ^ Academic Press, which is 
20 hereby incorporated herein by reference. The DNA is amplified 
asymmetrically by unbalanced primer concentration to yield an 
excess of one strand for sequencing etnd attached to a substrate 
by manual or automated means. 

An alternative form of automated localization is 
25 described above in positioning of a succession of smaller sized 
polymers which are manually or automaticeilly linked to the 
substrate in a pattern reflecting sequence overlaps. 

(2) Enzymatic polymerization method. 

30 The nucleic acid template is, in some embodiments^ 

attached to the -surface by either the 5» or the 3' end, usually 
by a method as described above. A preferred method of 
attachment is to-anneal the template to an oligonucle tide 
attached to thet surface and to crosslink the template to the 

35 oligonucleotide, oligonucleotide primers are usually 
synthesized chemically. In this case, the immobilized 
oligonucleotide may also serve as a primer for polymerization. 
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Because polymerization proceeds 5* to 3 • on the primer, the 
template will be attached by its 3* end, or a site 3» proximal 
to the region to be sequenced, for the purposes of the 

description to follow. 
5 step 1: A DNA-dependent , DNA polymerase such as 

those used for conventional DNA sequencing, for example, Klenow 
fragment of E.coli DNA Pol, Sequenase (modified T7 DNA 
polymerase), ta^ (TV ^prmns aouaticus ^ DNA polymerase, Bst 
f Bacillus atearothermoDhilus ) . DNA polymerase, reverse 

10 transcriptase (from AMV, MMLV, RSV, etc.) or other DNA 

polymerases, and the reaction components appropriate to the 
particular DNA polymerase selected, are placed in the 
incubation chamber in direct contact with the surface. 

Step 2: Fluorescent chain terminators (analogs of 

15 dATP, dCTP, dGTP, and TP, each labeled with fluorophore 

preferably emitting at a distinguishable wavelength) are added 
to the reaction at a sufficient concentration and under 
suitable reaction conditions (time, temperature, pH, ionic 
species, etc., sge SambrpoJc fitjl- (1989) Wole<?M3.ay glonlnq, 

20 vols. 1-3, and Prober et al .) to cause essentially all of the 
chains on the surface to be extended by one base emd thereby 
terminated. Detection of the specific label thereby 
incorporated into each chain identifies the last base added at 

each positional address in the matrix. 

25 Step 3 : The chain termination should be reversible 

by some means, such as treatment with light, heat, pH, certain 
other chemical or biological (enzymatic) reagents, or some 
combination of these. Typically the chain termination results 
from a blocking moiety which is labile to mild treatment. By 

30 one of these means, the blocJced 3 'OH of the terminating base 
aust be made available for chain extension in the next round oi 

polymerization. 

step 4: There are several sxiiteOsle labeled, 

terminator structures as follows: 
35 (a) The fluorophore itself functions as the chain 

terminator by placement on the 3 ' hydroxyl through a linkage 
that is easily and efficiently cleaved (removing the label and 
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leaving the free 3 "OH) by light, heat, pH shift, etc. The 
surface is scanned with a" scanning system, e.g., the 
fluorescence detection system described in PCT publication no. 
WO90/X507O; and U.S.S.IT. 07/624,120. Then, preferably in a 
5 single step, the fluorophore is removed and the chain is 
activated for the next round of base addition. 

(b) The fluorophore is placed in a position other than the 
3 "OH of the nucleoside, and a different group is placed on the 

3 •OH of the dNTPs to function as a chain terminator. The 
10 fluorophore and the 3 • blocJdng group are removed by the same 
treatment in a single step (preferably) , or they may be removed 
in separate steps. 

(c) An alternative polymer stepwise synthetic strategy can 
be employed. In this embodiment, the fluorophores need not be 

15 removable and may be attached to irreversible chain 

terminators. Examples of such compounds for use in sequencing 
DN& include, but are not limited to, dideoacynucleotide 
triphosphate analogs as described by Prober et^. (1987) 
Science 238:336-341. A second,, unlabeled and reversible, set 

20 of terminators is also required. Examples of these comE>ounds 
are deoxynucleotide triphosphates with small blocking groups 
such as acetyl, tBOC, NBOC and IJVOC on the 3 'OH. These groups 
axe. easily and efficiently removed under conditions of high or 
low pH> exposure to light or heat, etc. After each round of 

25 base addition and detection, the fluorophores are deactivated 
by exposure to light under suitable conditions (these chains 
have their labeling moiety destroyed and remain terminated, 
talcing part in no further reactions) . The unlabeled, 
reversible terminators are unblocked at the 3 'OH by the 

30 appropriate treatment to allow chain extension in subsequent 
rounds of elongation. The proportion of chains labeled in each 
roimd can be controlled by the concentration ratio of 
fluorescent to- non- fluorescent terminators, and the reaction 
can be driven *to completion with high concentrations of the 

35 unlabeled terminators. 

(d) A single dye strategy is used ^ere all the base 
analog terminators carry the same fluorophore and each is are 
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added one at a time: h, C, G, T. The addition of each base is 
followed by scanning detection and labeled. After all four 
fluorophore are added, reversal of the termination is 
performed, allowing for the addition of the nert base analog. 
5 Then, each scanning step determines whether the immediately 
preceding labeled nucleotide had been incorporated at each 
distinct position. 

The structures of the fluorescent labeled and 
reversible terminator base analogs are selected to be 

10 compatible with efficient incorporation into the growing chains 
by the particular DNA polymerase (s) chosen to catalyze 
extension. For example, where two different chain terminators 
are used, they may be utilized by two different polymerases 
that are both present during the chain extension step. 

15 step 5: An optional step is the permanent capping of 

chain extension failxires with high concentrations of 
dideoxynucleotide triphosphates. This step serves to reduce 
the background of fluorescence caused by addition of aiT 
incorrect base because of inefficient chain extension 

20 (termination) at an earlier step. . 

Step 6: After scanning to determine fluorescence, 
the fluorophore is removed or deactivated. Deactivation of the 
fluorophore can be achieved by a phot ode struct ion event. The 
chain elongation block is reversed (usually by removing a 
25 blocking group to expose the 3 •OH) by suitable methods that 

depend on the particular base euialogs chosen; and the substrate 
is washed in preparation for the next round of polymerization. 

Step 7: Repeat the cycle. 

30 B. Chain Deg radation Method 

This -method involves labeling the last base of the 
chain (distal to the surface attachment) with a fluorescent tag 
followed by base-specific removal- All the polynucleotide 
clusters on the matrix are labeled using a standard labeling 

35 moiety- Base-specific removal of the last base of each chain, 
interspersed vith fluorescence scanning of the array, will 
rev al the disappearance of fluorescence and hence the identity 
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of the last base of each chain. When all four labeled end 
bases have been removed, the polymers attached to the matrix 
are relabeled and the process is repeated, working successively 

on the DUa. chains. 
5 Alternatively, if the label allows distinguishing 

between different monomers, simpler, degradation processes may 
be employed. A single scan step can distinguish between all 
four possible terminal nucleotides. The four separate removal 
steps are then combined into a single nonspecific terminal 
10 nucleotide removal step. 

The DNA will usually be attached to the substrate by 
the 3' or 5' terminus depending on the scheme of labeling and 
cleavage- Because there are well-known 5 '-labeling methods, 
see, e.g.. Gait (1984) m i gonucleoti de Synthesis ; A Practical 
15 Approach , IRI* Press, Oxford, this discussion will assume the 3- 
end is attached to the substrate with the 5' end free. 

Step 1: All the 5* -end bases ars- labeled with 
5 '-specific chemistry, e.g., 5' amino linkage to FITC, Nelson 
et al . (1989) ^»r.l . Acids Res. 17:7179-7186, which is hereby 
20 incorporated herein by reference. 

step 2 : Scan the matrix to obtain the background 

level. 

Step 3: optional: Cap all of the labeling failures, 
e.g., polymers whose ends were not labeled. 

25 ^ step 4: The terminal A's are removed with end-base, 

A-specif ic reagents (such a reagent may be chemical or 
biological), one example is a 5 • -f luorescein-dAMP-specif ic 
exonuclease made as a catalytic antibody (see the description 
above for a scheme of producing this, reagent) - 

30 Step 5: Scan the matrix to detect those chains that 

had terminated in A (these will be reduced in fluorescence 
compared to the fluorescent labeled background) . 

step 6: Repeat steps 4 and 5 for each of other three 
possible bases using the appropriate fluorescein-base-specific 

35 cleavage reagent and scan after removal of each of the Cs, the 
G's, and the T's. This succession of steps will allow the 
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determination of the terminal nucleotide of each positionally 

defined cluster. 

Step 7: Relabel the 5* terminal nucleotide of all the 
new end bases that have been eicposed by the earlier rounds of 
5 cleavage, and repeat the stepwise removal and scanning 
processes. 

This approach can be extended to protein sequencing 
using 20 catalytic antibodies (or other amino acid-specific 

cleavage reagents) , each recognizing a terminal amino acid and 
10 removing that terminal rfesidue. 

The process for sequencing laay be summarized as 
follows for enzymatic polymerization: 

1) Target DNA templates (to be sequenced) are 
attached at positionally defined locations on the matrix 

15 substrate. 

2) Fluorescent chain terminators are added to a 
primer under conditions where all polymer chains are terminated 
after addition of the "next base complementary to the template. 

3) The matrix is scanned to determine vrtiich base was 
20 added to each location. This step correlates the added base 

with a position on the matrix. 

4) Chains failing to extend (and therefore to 

terminate) are capped. 

5) The fluorophores are removed or deactivated. 

25 6) The terminators are activated for further chain 

extension, usually by removal of a blocking group. 

7) steps 2 through 6 eire repeated to obtain the 
base-by-base sequence of many different positionally separated 
DNA fragments simultaneously. 

30 

C, 5^cy^-enina fn^- new ny ril Aotide analoq/PolVTBer^se 
^nrph-t nations. 

The use of a functional combination of bl eked 
nucleotide with a polymerase is ijmportant in the synthetic 
35 embodiment of the present invention. It is . important to ensure 
that only a single nucleotide is incorporated at the 
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•appropriate step. The following protocol describes how to 
screen for a functional combination. 

Test 1. (test for polymerase inhibition) 

5 

In a reaction voltane of 20 nl, mix 

1 ug M13mpl9 single stranded DNA template 
2.5 ng standard M13 primer (17-mer : 
10 5 » -GTTTTCCCAGTCACGAC-3 ' 

60 mM tris-Cl pH 8.5 
7.5 mM MgC12 
75 mM NaCl 

15 

Template and primer are annealed by heating to 95* C, 
then cooling to -25' 

Extension components are added: 

20 

50 (each) dATP, dCTP, dGTP, TTP 
10 MCi P32 dATP 

0»01 /iH to 1 mM of the putative terminator compound, 
25 further titrations may be desired, 

20 "units AM7 reverse transcriptase 
water to 20 ^1 final volume^ 



30 



The reaction is run at 42* for about 30 minutes. 
Aliguots are talcien at 10, 20, 30 minutes, and saa^les 
are TCA precipitated after the addition of 10 ^g tRNA 
carrier. 

The filters are counted for acid-precipitable 
35 radioactivity and the mass of dATP incorporated is 

calculated as a function of reaction time. 
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Control reactions aire run in parallel consisting of 

A) no added terminator 

B) 10 and 100 

The termination activity of the experimental samples 
relative to that of ddNTPs is estimated, and a nucleotide is 
appropriate for further testing if it substantially decreases 

the number of acid precipitable counts at any time or relative 
concentration. 



10 



Test 2 (test for base specific termination activity) 

Reactions are run essentially as described by Prober 
15 et al, except; 

1. Unlabelled primer is used 

2. 1 fiCi P32 dATP is included 

3. No dideoxyNTPs are added to the experimental 

20 samples (control reactions containing ddNTP 

at the usual concentrations, and no test 
terminators are run in parallel) 

4. The test compound is added at a 
concentration estimated to give i% and 10% 

25 inhibition of incorporation as determined by 

test #1. 

The reactions are run for 10 min at 42*. 
c^OO /m dNTPS are added and the reaction run for 

30 tfan additional 10 min. A portion of the reaction 

-gfeis prepared and run on a sequencing gel in the 
Msual fashion. The ladders obtained with, the 
tctest compound are coiapared with that obtained in 
a^ihe ddNTP reactions and the fidelity of the 

35 termination activity of the test compound is 

^thereby assessed. 
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IV. ' Apparatus 

The present invention provides a new use for an 
apparatus comprising a reaction chamber and a scanning 
apparatus which can scan a substrate material exposed to the 
5 chamber. . Figure 11 illustrates a system and a schematized 
reaction chamber to which is attached a silicon or glass 
substrate. The system has a detection system 102 as 
illustrated, in one embodiment, in Fig. 7. A silicon substrate 
104, is attached against and forming a seal to maJce a reaction 
10 chamber 106. Leading into and out of the chamber are tubes 
108, with valves 110 which control the entry and exit of 
reagents 112 which are involved in the stepwise reactions. The 
chamber is held at a constant temperature by a temperature 
block 114. 

15 All publications and patent applications are herein 

incorporated by reference to the same extent as if each 
individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 
The invention now being fully described, it will be apparent to 

20 one of or^atry sJcill in the art that many changes and 

modifications can be made thereto without departing from the 
spirit or scope of the claims. 
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WHAT- T5 CIAIMED TS: 

1- A method for determining the positions of 
polymers which terminate with a given monomer, where said 
5 polymers are attached to a surface having a plurality of 
positionally distinct polymers attached thereto, said method 
comprising the steps of; 

labeling a terminal monomer in a monomer type 
specific manner; and 
10 scanning said stirface, thereby determining the 

positions of said label. 

2. A method of Claim 1, wherein said polymers are 
polynucleotides. 

15 

3. A method of Claim 1, wherein the labeling of the 
terminal marker comprises incorporation of a labeled terminal 
monomer selected from the group of nucleotides consisting of 
adenine, cytidine, guanidine and thymidine. 

20 

4. A method for concurrently determining which- 
subset of a pliirality of positionally distinct polymers 
attached to a solid substrate at separable locations terminates 
with a given terminal subunit, said method comprising the steps 

25 of: 

mixing said solid substrate with a solution 
comprising a reagent, which selectively marlcs positionally 
distinct polymers which terminate with said given terminal 
subunit; cind 

30 : determining with a detector which separable 

locations are -marked, thereby determining which subset of said 
positionally distinct polymers terminated with said given 

terminal subunit. 



35 5* A method of Claim 4, wherein said solution 

comprises a reagent which marScs said positionally distinct 
polymer with a fluorescent label moiety. 
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6. A method of Claim 4 , wherein said terminal 
subunit is selected from the group consisting of adenosine, 
cytosine, guanosine, and thymine. 

5 7. A method for determining which subset of a 

plurality of primer polynucleotides have a predetermined 
oligonucleotide, wherein the polynucleotides are complementary 
to distinctly positioned template strands which are attached to 
a solid substrate, said method con^rising the steps of: 
20 selectively mar)cing said subset of primer 

polynucleotides having the predetermined oligonucleotide? and 

detecting which polynucleotides are marked, 

8. A method of Claim 7, wherein said oligonucleotide 
15 subunit is a single nucleotide. 

9. A method of Claim 8, wherein the marking 
comprises elongating said primer with a labeled nucleotide 
which is complementary to a template. 

20 

10. A method of Claim 1, wherein the marking step 
uses a polymerase and a blocked and labeled adenine. 

11. A method for concurrently obtaining sequence 
25 information on a plurality of polynucleotides by use of a 

single label detector, said method comprising the steps of; 

attaching a plurality of positionally distinct 

• polynucleotides to a solid substrate at separable locations; 

labeling said plxirality of polynucleotides with a 
30 terminal nucleotide specific reagent, said label being 
detectable using said label detector; 

determining whether said specific labeling 
reagent has labeled each separable- location. 

35 12. A method of Claim 11, herein said labeling is 

performed with reagents which can distinguishably label 
alternative possible nucleotide monomers. 
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* 13. A method of Claim 11, wherein at least four 
replica siibstrates are produced, each of which is labeled with 
a specific labeling reagent for adenine^ cytosine, guanine, or 
thymine . 

5 

14. A method of Claim 11, wherein said letbeling and 
determining steps are performed in succession using reagents 
specific for each of adenine, cytosine, guanine, and thymine 
monomers. 

10 

15. A method for concurrently obtaining sequence 
information on a plurality of polynucleotides, said method 
comprising the steps of: 

attaching distinct polynucleotides to a plurality 
15 of distinct solid substrates; 

labeling said plurality of solid substrates with 
a terminal nucleotide specific labeling reagent? and 

determining whether said specifTc labeling 
reagent has labeled each distinct substrate. 

20 

16. A method of Claim 15, wherein said labeling, step 
is performed in a continuous flow of distinct solid substrates 
through a reaction solution. 

25 17. A method for simultaneously sequencing a 

plurality of polymers made up of monomer units, said' plurality 
of polymers attached to a substrate at definable positions, 
said method comprising the steps of : 

mixing said substrate with a reagent which 
30 specifically recognizes a termineil monomer, thereby providing 
identification among veirious terminal monomer units; and 

scanning said substrate to distinguish signals at 
definable positions on said substrate; and 

correlating said signals at defined positions on 
35 said substrate to provide sequential series of sequence 
determinations . 
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18- A method of Claim 17, wherein said plurality of 
polymers are synteesized by a plurality of separate cell 
colonies • 

5 19. A method of Claim XI, wherein said plurality of 

polymers are attached to said substrate is by a carbonyl 
linkage. 

20- A method of Claim 17, wherein said polymers ajre 
10 polynucleotides. 

21. A method of Claim 17, wherein said substrate 
comprises silicon. 

15 22. A method of Claim 17, wherein said scanning 

identifies a fluorescent label. 

23. A method of Claim 17, wherein said reagent 
exhibits specificity of removal of terminal monomers. 

20 . 

24. A method of Claim 17, wherein said reagent . 

exhibits specificity of labeling of terminal monomers. - 

25. A method for sequencing a plurality of^ distinctly 
25 positioned polynucleotides attached to a solid substrate 

comprising the steps of: 

hybridizing complementary primers to said 

plurality of polynucleotides; 

elongating a con^lementary primer hybridized to a 

30 polynucleotide by adding a single nucleotide; and 

identifying ^diich of said complementary primers 
have incorporated said nucleotide. 

2€. A meth d of Claim 25> wherein said elongating 
35 step is performed simultaneously on said plurality f 
polynucleotides linXed to said substrate. 
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27. A ineth d of Claim 25, wherein said substrate is i 
two dimensional surf ace and said identifying results from a 
positional determination of said complementary primers 
incorporating said single defined nucleotide. 

5 

28. A method of Claim 25, wherein said substrate is a 
silicon substrate. 

29. A method of Claim 25, wherein said linking is by 
10 photocrossl inking said polynucleotide to said complementary 

primer, where said primer is attached to said substrate. 

30. A method of Claim 25, wherein said elongating is 
catalyzed by a DNA dependent polymerase. 

15 

31. A method of Claim 25, wherein said single defined 
nucleotide has a removable blocking moiety to prevent further 
elongation.-' 

2° 32, A method of Claim 31, wherein said blocking 

moiety is NVOC. 

33- A method of Claim 25, wherein said single 
nucleotide has both a blocking moiety and labeling moiety. 

25 
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